Bayesian ranking and selection methods using hierarchical mixture models in microarray studies.

نویسندگان

  • Hisashi Noma
  • Shigeyuki Matsui
  • Takashi Omori
  • Tosiya Sato
چکیده

The main purpose of microarray studies is screening to identify differentially expressed genes as candidates for further investigation. Because of limited resources in this stage, prioritizing or ranking genes is a relevant statistical task in microarray studies. In this article, we develop 3 empirical Bayes methods for gene ranking on the basis of differential expression, using hierarchical mixture models. These methods are based on (i) minimizing mean squared errors of estimation for parameters, (ii) minimizing mean squared errors of estimation for ranks of parameters, and (iii) maximizing sensitivity in selecting prespecified numbers of differential genes, with the largest effect. Our methods incorporate the mixture structures of differential and nondifferential components in empirical Bayes models to allow information borrowing across differential genes, with separation from nuisance, nondifferential genes. The accuracy of our ranking methods is compared with that of conventional methods through simulation studies. An application to a clinical study for breast cancer is provided.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A marginal mixture model for selecting differentially expressed genes across two types of tissue samples.

Bayesian hierarchical models that characterize the distributions of (transformed) gene profiles have been proven very useful and flexible in selecting differentially expressed genes across different types of tissue samples (e.g. Lo and Gottardo, 2007). However, the marginal mean and variance of these models are assumed to be the same for different gene clusters and for different tissue types. M...

متن کامل

Diagnosis of Breast Cancer Subtypes using the Selection of Effective Genes from Microarray Data

Introduction: Early diagnosis of breast cancer and the identification of effective genes are important issues in the treatment and survival of the patients. Gene expression data obtained using DNA microarray in combination with machine learning algorithms can provide new and intelligent methods for diagnosis of breast cancer. Methods: Data on the expression of 9216 genes from 84 patients across...

متن کامل

The Family of Scale-Mixture of Skew-Normal Distributions and Its Application in Bayesian Nonlinear Regression Models

In previous studies on fitting non-linear regression models with the symmetric structure the normality is usually assumed in the analysis of data. This choice may be inappropriate when the distribution of residual terms is asymmetric. Recently, the family of scale-mixture of skew-normal distributions is the main concern of many researchers. This family includes several skewed and heavy-tailed d...

متن کامل

BAYESIAN MODELS FOR DNA MICROARRAY DATA ANALYSIS A Dissertation by KYEONG

Bayesian Models for DNA Microarray Data Analysis. (May 2004) Kyeong Eun Lee, B.A., Kyungpook National University, Korea; M.A., Seoul National University, Korea Co–Chairs of Advisory Committee: Dr. Bani K. Mallick Dr. James A. Calvin Selection of significant genes via expression patterns is important in a microarray problem. Owing to small sample size and large number of variables (genes), the s...

متن کامل

Bias-corrected Hierarchical Bayesian Classification with a Selected Subset of High-dimensional Features

Class prediction based on high-dimensional features has received a great deal of attention in many areas. For example, biologists are interested in using microarray gene expression profiles for diagnosis or prognosis of a certain disease (eg, cancer). For computational and other reasons, it is necessary to select a subset of features before fitting a statistical model, by looking at how strongl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Biostatistics

دوره 11 2  شماره 

صفحات  -

تاریخ انتشار 2010